TAICAR-The Collection and Annotation of an In-Car Speech Database Created in Taiwan

نویسندگان

  • Hsien-Chang Wang
  • Chung-Hsien Yang
  • Jhing-Fa Wang
  • Chung-Hsien Wu
  • Jen-Tzung Chien
چکیده

This paper describes a project that aims to create a Mandarin speech database for the automobile setting (TAICAR). A group of researchers from several universities and research institutes in Taiwan have participated in the project. The goal is to generate a corpus for the development and testing of various speech-processing techniques. There are six recording sites in this project. Various words, sentences, and spontaneously queries uttered in the vehicular navigation setting have been collected in this project. A preliminary corpus of utterances from 192 speakers was created from utterances generated in different vehicles. The database contains more than 163,000 files, occupying 16.8 gigabytes of disk space.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Subspace-Based Speech Enhancement with Perceptual Filterbank and SNR-Aware Technique

In this paper, a new subspace-based speech enhancement algorithm is presented. First, we construct a perceptual filterbank from psycho-acoustic model and incorporate it with the subspace-based enhancement approach. This filterbank is created through a five-level wavelet packet decomposition. Next, the prior SNR of each critical band are taken to decide the attenuation factor of the optimal line...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Multiband Subspace Tracking Speech Enhancement for In-Car Human Computer Speech Interaction

In this paper, a new subspace-based speech enhancement algorithm for in-car human computer speech interaction is presented. We first incorporate a perceptual filterbank which is derived from psycho-acoustic model with signal subspace approach to effectively suppress in-car noises of engine. Second, for real-time applications, a new subspace tracking algorithm is derived by modifying PASTd algor...

متن کامل

SPEECHDAT-CAR. A Large Speech Database for Automotive Environments

The aims of the SpeechDat-Car project are to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. As a result, a total of ten (10) equivalent and similar resources will be created. The 10 languages are Danish, each language 600 sessions will be recorded (from at least 300 speakers) in seven characteristic envir...

متن کامل

Fuzzy Neighbor Voting for Automatic Image Annotation

With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJCLCLP

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2005